
SUBSTITUTE SPECIFICATION 



SYSTEM AND METHOD FOR PROFIT MAXIMIZATION 
IN RETAIL INDUSTRY 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

5 This invention relates to systems and methods that use point-of-sale data for 
optimization of prices and promotion schedules for groups of interrelated products 
with the purpose of maximization of a preferred merchandising figure of merit like 
revenue, profit, etc. 

2. Background and State of the Art 

10 Businesses commonly and supermarket chains in particular, use various promotional 
and pricing schemes to improve revenues, profits and sales volumes. For instance, a 
supermarket manager may adjust prices to encourage sales of particular products. 
Also, he may specially present or advertise the products to increase consumer 
awareness and demand. Because of the variety of promotion discounts often applied 

15 at any one time, the complexity of the market, and difficulties in understanding of 
customer response, it is usually very difficult to accurately forecast effectiveness of 
various promotional tools and price adjustments, and to evaluate the influence of 
each selected tool on the overall sales revenues. It may still be even more difficult to 
plan various marketing campaigns and pricing adjustments beforehand to achieve 

20 the overall business optimality. Since predicting and assessing the consequences of 
various pricing and promotion strategies in a reliable manner is beyond the 
capabilities of the unaided human mind, there have been developed a number of 
approaches and computerized systems for dealing with the arising problems. 

Commercially available inventory management systems such as the Maxagrid system 
25 include a yield management system which produces a pricing forecast used to 
determine prices for sales based on factors such as past trends and performance 
data which are updated periodically in order to maintain an accurate pricing model. 
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The user making promotion and pricing decisions is faced with the following 
problems: connecting various promotion tools and prices with resulting sales in the 
past (for which construction of statistical models is necessary), isolating additional 
factors (many of which may be outside of decision-maker's control) that could 
influence sales volumes, developing statistical tools for predicting future demands, 
optimizing model parameters that will render optimal sales, performing necessary 
adjustments in promotions and prices. 

The range and severity of the accompanying problems could be glimpsed from the 
following incomplete list: 

1. Historical data available is usually insufficient, incomplete and often may 
appear contradictory. Relying on older data improves observations to 
parameters ratio but raises the questions of data relevancy for the current 
situation. Missing and outlying observations do not make life easier. 

2. Isolating useful observable relevant factors, beyond promotions and 
prices, is a difficult task in itself. 

3. While prices make a single most important factor, promotion effects are 
much less pronounced, interdependent, sometimes contradictory and 
confusing. 

4. Especially complex are promotion effects for products with substitution 
demand and complementary demand. For example, offering one product 
at a discount might cannabalize sales from another product, and 
ultimately fail to yield greater revenue or greater profit. It is even more 
difficult to evaluate effects of promotions that pair more than one product. 

5. Clearance sales of selected products, so often employed by department 
managers, may effect demand for substitute products in unpredictable 

ways. 

6. The estimated optimal values of prices may be incompatible with overall 
management goals and therefore unacceptable. 



MAK-106US - 3 - 



The system and method in the present invention have been constructed with explicit 
purpose of solving these and many other problems arising in the process of 
optimizing joint pricing and promotion influences on the overall figure of merit, such 
as revenue, gross margin ,etc. Detailed description of our invention is given in the 
5 following sections, while in the remainder of this section we list most relevant 
existing inventions, and pinpoint their major differences from our invention. 

US Patent 5,596,493 (Tone, et al., 1997): It is an early work describing a complete 
statistical demand model suitable for prediction of sale amounts based on historical 
database. Log-normal linear regression model of demand is used. Besides demand 

10 prediction, regression model is used for purposes of determining safe stock, 
restocking, etc. Suggested possible relevant regressors include days of week, 
weather, prices and various categories of advertisements. The problem of selection 
of relevant regressors is addressed, as well as probabilistic measures of model 
uncertainty due to numerous sourced of error. The method for ordering for 

15 restocking is proposed that comprises the steps of entering POS data, finding sale 
amount data of individual goods for a predetermined time period obtained on the 
basis of the POS data, calculating basic statistical values of daily sale amounts of the 
individual goods based on the sale amount data of the individual goods, classifying 
the basic statistical data into one of plural class types, estimating the sale amount in 

20 accordance with the class types and calculating the amount of a restocking order 
based on the estimated sale amount and an amount on stock. The system for 
classifying sale amount characteristics comprises means for entering POS data, 
means for finding sale amount data of individual goods for a predetermined time 
period obtained on the basis of the POS data, means for calculating basic statistical 

25 values of daily sale amounts of the individual goods based on the sale amount data 
of the individual goods, and means for classifying the basic statistical data into one 
of preset plural class types. 

Like most later patent applications, the model in our invention uses some of ideas of 
the present invention while making considerable additions and refinements, and 
30 developing sophisticated statistical tools for comparative assessment of influences of 
various factors on demand levels. For instance, our system is capable of direct 
estimation of promotion effects of various clips run on in-store monitors, and of 
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estimating comparative efficiency of those clips. Also, it integrates influences of 
pricing and promotion into a single system and uses it for optimization of both prices 
and promotion schedules simultaneously. 

US Patent 5,712,985 (Lee , et al., 1998) is a continuation in part of U.S. Patent 

5 5,459,656. The invention provides a system and method for analyzing business 
demand which incorporates tracking of past business demand for a plurality of 
products or tasks, time intervals during the day, and providing improved projection 
of business demand for such items. The system uses the concept of a business 
influence to aggregate, store, access, and manipulate demand data for the purpose 

io of forecasting future demand levels for one or more business items. A business 
influence is any type of quantifiable factor that produces a variation in demand for 
some type of business item. The business influences model is composed of three 
distinct entities: a base profile, at least one influence profile, and a forecast profile. 
The base profile, influence profiles, and forecast profiles are data storage structures 

15 that persistently maintain their associated profile information in selected files in a 
database. The profiles are time-demand curves where demand is represented as 
either quantity or percentage units. An influence profile reflects the changes in 
demand for a business item due to a particular identifiable condition, such as the 
weather, or a sale, or the like. Influence profiles are selected and combined with the 

20 base profile to create a forecast profile. The base profile and influence profiles are 
demand curves representing a particular level of demand for a business item in each 
of the number of time intervals. Seasonality influence profiles may also be created to 
represent the influence of long-term seasonal influences. The forecast profile is a 
projection of anticipated demand for a business item based on its base profile and 

25 any selected influence profiles or seasonality profiles, for a selected period. The 
selected period may be any useful time period, such as a business quarter, month, 
week, day, hour, minute, and so forth. In order to project demand then, a base 
profile for a selected business item is combined with any number of influence profiles 
to create a forecast profile. The user selects a business item to be produced or 

30 scheduled during some time interval. The selection of a business item, and 
subsequent forecasting may be repeated for multiple business items. The user 
selects a base profile for the business item and any number of influence profiles. In a 
preferred embodiment the business item is associated with a base profile, and 
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selected influence profiles, so that selection of the business item results in automatic 
selection of the profiles. In a preferred embodiment, each business day is associated 
with at least one base profile and influence profile that captures the variations in 
demand patterns which effect each demand for a business item associated with the 
5 base profile. In the preferred embodiment, the base profile stores a historical 
exponentially smoothed average of actual demand for the item in each of a variable 
number of time intervals. In alternative embodiments, the base profile stores a 
moving average of actual, a forward trend average, or other types of historical 
averages. 

10 From the given description of the invention, it is not quite clear how it differs from a 
standard statistical approach of setting-up a regression model, estimating model 
parameters, and updating the whole process as new data arrive. The regression 
model need not be linear: it may be a locally weighted regression, or a generalized 
additive model, or whatever. Such an approach adopted in our own patent 

15 application would use well-developed tools for weighting, updating, estimating 
statistical properties of the available data, etc. It could work automatically, i. e. 
without user's intervention, or provide an opportunity for interaction. 

US Patent 5,987,425 (Hartman, et al.1999) describes a variable margin pricing 
system that generates retail prices based on price sensitivity and cost, and allows 

20 dealers more flexibility and control over the retail pricing of the products. After 
receiving electronic information identifying a plurality of products and electronic 
product cost information, the customer price sensitivity, and logical relationships 
between gross profit margins and the customer price sensitivity, are determined for 
the products. The system electronically assigns varying margins to the products 

25 based on the logical relationships between margins and the customer price 
sensitivities. Retail prices for each of the products are then electronically generated, 
as determined by the cost information and the assigned margins for each of the 
corresponding products. In short, the system ranks products by their dollar costs and 
assigns smaller gross profit margins to products with higher costs because of higher 

30 price sensitivity of consumers to such products. This assignment is based on ready- 
made formulas that can be modified from expert knowledge. This invention can be 
seen as computerization of earlier manual optimization of variable price margins 
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based on expert assessments and comparisons with competitors' prices. The 
described invention differs considerably from our invention in which pricing and 
promotion are optimized in the framework of a constructed statistical model and 
estimated using historical database and data mining tools. 

5 US Patent 6,029,139 (Cunningham, et al. 2000) describes a system and method of 
evaluating and optimizing promotional plans for products, segments of products or 
categories of products. The promotion optimization system determines both the 
costsand the benefits of a proposed promotion plan for the sale of products. Using 
both costs and benefits, it proposes a promotional plan that will better meet the 

10 user's goals. This may involve new promotion plans or existing promotions scheduled 
at different times. Another option may be the coordination of promotions between 
two related segments of products, i.e. groups of products that may be promoted 
together. Each product has an associated sales history and manufacturer. Neural 
networks are used for processing the corresponding data structures. Sales objectives 

15 and constraints are applied to neural networks generating promotional plans for 
product segments. 

Since the cited patent application does not go into description of statistical 
techniques for processing historical data, it does not easily render itself to 
comparisons. Still, it is clear that it does not contain modern data-dependent means 

20 for assessing efficiency or inefficiency of promotional plans that may depend on huge 
number of parameters, and in particular it does not provide means for estimating 
efficiency of in-store monitors carrying promotion clips which is suggested in our own 
patent application. Its estimation structure appears fixed in that it does not provide 
means of integration of large numbers of various factors like different forms of 

25 promotion schedules, pricing plans, time-dependent structures, etc. into a common 
estimation system capable of sorting out partial influences of various factors on 
cumulative product demand. Furthermore, it is well known that neural networks are 
not useful for isolating influences of multiple factors operating simultaneously in 
different directions. At the same time, it can be seen from the description of our own 

30 work that isolating influences of different factors is extremely important for effective 
optimization of promotion and pricing effects simultaneously. 
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US Patent 6,076,071 (Freeny, Jr., 2000) describes an automated product pricing 
system including a physical store system, a virtual store system, and a control 
system. The physical and virtual store systems transmit sales data indicative of the 
number of sales of respective products. The control system receives the sales data 
5 from the physical store system and the virtual store system, and generates price 
change data including a changed price of an identified product based on the sales 
data received from at least one of the physical and virtual store systems. The price 
change data is then transmitted by the control system to at least one of the physical 
and virtual store systems to thereby change the price of the identified product. Thus, 

10 the system communicates advertising price change codes indicative of different 
advertised prices, i.e. it is capable of price optimization. On the other hand, although 
it handles advertising, it does not have means for assessing efficacy of different 
methods of advertising, neither means for sharing out a cumulative demand increase 
to various sources of advertising. By implication, it cannot estimate sources of 

15 demand variability and cannot simultaneously optimize pricing and advertising. 

US Patent 6,078,893 (Ouimet, et al., 2000). In this invention, the user selects a 
demand model and a market model. The market model describes how some of the 
parameters of the demand model behave according to external market information. 
The market model is derived by studying prior sets of sales histories and determining 

20 an empirical relationship between the sales histories and the parameters of the 
demand model. The user first selects a consumer demand model to be tuned to the 
sales data. Consumer demand models are known in the art, and in a preferred 
embodiment, the user will be provided with a database of predefined demand models 
from which to choose. The user will also be given the option of defining a new 

25 demand model that can be tailored to meet the user's specific needs. Next, the user 
selects a market model, which describes how some parameters of the demand model 
are expected to behave according to external market information. He will be given a 
number of options for selecting a market model. In addition, the user can also be 
provided with a database of predefined market models, each corresponding to a 

30 particular demand model, from which to choose. In a similar fashion, the user is 
given a number of other options for making important decisions. 



MAK-106US - 8 - 



This model is very complex and contains a lot of elements that have to be fixed by 
the user to enable the system to work. Consider, for instance, selection of a market 
model that functions as a penalty function for the demand function. Such a selection 
is by no means a simple matter for an average user. Moreover, its parameters are to 
5 be estimated before it could be used in the system for modifying the demand 
function. Erroneous selection or estimation of a market model will result in an 
erroneous combined model, thereby ruining the initial demand function instead of 
correcting it. By contrast, in our own invention this problem is solved by restricting 
potential prices to an priori selected neighborhood of the current price in the pricing 
10 space. Such an approach does not require from the user to make decisions that he 
may be unwilling or unable to make. 

US Patent 6,094,641 (Ouimet, et al., 2000). In this invention, the original demand 
model is modified to include a mechanism to convert actual prices into perceived 
prices, thus causing the demand model to predict higher demand for certain prices. 

15 The user specifies the function that converts from real prices to perceived prices. 
This modified demand function is then fitted to a sales history to yield the 
parameters appropriate to its particular form. Also, the demand model can be 
modified to account for promotional effects. The user defines a visibility model, which 
gives the relative increase in demand for an item caused by a promotion, and the 

20 cost of the promotion. The demand model is modified to include the effect of 
increased demand based on the visibility, and a profit model is modified to account 
for the added cost due to the added visibility. The profit model is then optimized with 
respect to both prices and promotions. Advertising is used in a general sense 
including newspaper ads, etc. Psychological effects include price thresholds, etc. 

25 Model selection refers, apparently, to a choice of the form of demand function. 
Although it ostensibly provides additional features to the user, it is hard to see how 
the user could make a meaningful choice between different forms of demand 
function. Even more problematic for the user may be the need for defining a visibility 
model that should give the relative increase in demand for an item caused by a 

30 promotion. Visibility model is suggested to be given by a table providing the relative 
increase in demand for an item at a given price vis-a-vis promotion. It is not clear 
from the text how this 'relative increase' is to be estimated. Model estimation called 
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"tuning process" in the patent, includes also optimization and uses simulated 
annealing without any reasoning for its appropriateness. 

US Patent 6,553,352 (Delurgio et al., 2000) describes a method for enabling a user 
to determine optimum prices of products on sale. The interface includes a 
5 scenario/results processor that enables the user to prescribe an optimization 
scenario, and that presents the optimum prices to the user. The optimum prices are 
determined to maximize a merchandising figure of merit such as revenue, profit, or 
sales volume. The optimum prices are determined by execution of the optimization 
scenario, where the optimum prices are determined based upon estimated product 

10 demand and calculated activity based costs. The patent contains extended 
arguments for optimizing product groups rather than individual products. Promotion 
optimization is dealt with in the co-pending patent application 20030110072. 
Detailed descriptions concern mainly interfaces, other parts are very short and do 
not present sufficient detail for implementing the methods as stated. In particular, 

15 the patent does not contain description of method for estimating target functions for 
optimization, nor methods of optimization, nor methods for estimating errors that 
contaminate the results. 

US Patent Application 20020099678 Al (Albright et al., 2002) describes a system 
and method for predicting and analyzing the consequences of a pricing or 

20 promotional action in a retail setting, and also for monitoring the actual result of 
marketing actions and communicating real-time or near-real-time information 
regarding the results. According to one aspect of the invention, a management tool 
links sales data and modeling algorithms to predict the results of pricing or 
promotion actions, thereby allowing a user to propose an action and view the 

25 predicted results. According to another aspect of the invention, the management tool 
monitors an implemented action and assesses the effect of the action on 
performance metrics. According to another aspect of the invention, users can select 
elements for a template for a web page displaying select company information, such 
as news, a scorecard showing pricing/promotion action results, the company's and/or 

30 its competitors' stock prices, current market capitalization and corporate PSP sales. 
Graphical user interfaces allow a user to easily interact with the underlying modeling 
applications to set a specified goal, to query the consequences of proposed actions 
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and to compare results from more than one potential action using selected 
performance metrics. 

Despite these attractive features, the modeling system as described in patent 
application 20020099678 contains no clear method of integrating pricing and 
5 promotion influences into cumulative influence onto product's demand. In particular, 
it does not contain a means of simultaneous balancing of pricing influences that 
apparently are very strong with promotion influences that may be unreliable and 
sometimes negligible and statistically insignificant. Neither, does it contain means for 
assessing significance of promotion influences and deciding whether they warrant 

10 considerable changes of promotion scheduling in particular ways and directions. 
Next, although the system contains means for manipulating Mf-then' scenarios and 
queried proposed actions, it does not allow for optimization of prices and promotions 
by searching for optimal courses of action in the pricing-promotion space. Thus, on a 
number of important points, the system falls short of achieving the targets set up for 

15 the system described in our invention. 

US Patent Application 20020123930 Al (Boyd et al., 2002) provides a promotion 
pricing system for producing and evaluating promotion pricing strategies. In 
particular, a user may evaluate historical data to determine a promotional strategy 
to accomplish various business goals, such as increasing total sales volumes or 

20 increasing sales in certain desired market segments. The promotion pricing system 
can either propose a promotional strategy or evaluate the expected effect of a 
promotional policy provided by the user. The promotion pricing system works by 
defining market by specifying the various products in the market, as well as the 
suppliers and consumers. The promotion pricing system then looks to historical 

25 market data to create a market model which may be used to determine various 
information, such as profit or sales maximizing sales conditions. 

The model is very general in that it is capable of using multiple sources of external 
information like data on market share of various products, data from competitors' 
transactions, etc. As always, however, generality comes at a price: much more of 
30 varied historical data are needed for reliable estimation of model's parameters and 
for testing for statistical effects. Also, optimization modules may face difficulties 
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when trying to find solutions in high dimensional spaces. In contrast, the model in 
our invention can work with a bare minimum of historical data, i. e., with scanner 
data available in any computerized business environment. In addition, our system is 
capable of direct estimation of promotion effects of various clip series run on in-store 
5 monitors, and of estimating their efficiency or lack thereof while creating and 
monitoring promotion schedules. 

US Patent Application 20030110072 (Delurgio et al., 2003). This patent application is 
related to co-pending application 6,553,352 and describes a method for enabling a 
user to determine optimum prices of products on sale. The interface includes a 

io scenario/ results processor that enables the user to prescribe an optimization 
scenario, and that presents the optimum prices to the user. The optimum prices are 
determined to maximize a merchandising figure of merit such as revenue, profit, or 
sales volume. The optimum prices are determined by execution of the optimization 
scenario, where the optimum prices are determined based upon estimated product 

15 demand and calculated activity based costs. Detailed descriptions concern mainly 
interfaces, other parts are very short and do not present sufficient detail for 
implementing the methods as stated. In particular, although promotion optimization 
is mentioned, neither method for estimating target functions for optimization, nor 
methods of optimization are given. 

20 Our own previous invention described in US Patent Application no. 20030220830 
Al: Method And System for Maximizing Sales Profits by Automatic Display Promotion 
Optimization, filed 4/4/2002. The system in this patent application evaluates among 
other things, influences of various in-store promotion displays in a supermarket by 
using locally-weighted straight-line smoothers applied to historical data on demands 

25 in a database. A real time iterative optimization algorithm is used for calculation of 
optimal clip schedules. These functions are also present in our new invention in 
which they make part of the "promotion part" of the model. At the same time, the 
new model, or better say its "promotion part" is much more sophisticated and has a 
number of new features: it is not locally linear, is suitable for dealing with groups of 

30 interconnected products rather than with individual products, is provided with a 
learning algorithm for evaluating comparative efficiency of different promotion 
schedules, and some others. 
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BRIEF DESCRIPTION OF THE INVENTION 

The present invention describes a method and system for automatic maximization of 
a preferred merchandising figure of merit like revenue, profit, etc. in a supermarket 
or in a chain of supermarkets. It enables the user to determine optimum prices and 
5 in-store promotion schedules for groups of related products based on predicted 
product demands. Demand predictions are produced by statistical prediction 
algorithms coupled with data mining procedures applied to historical database that 
contains sales data together with various sales conditions. Determination of optimal 
prices is initiated by the user on the per request basis by execution of optimization 
10 scenarios, while optimal promotion schedules are generated continuously in real time 
after initial options have been set-up. 

Optimization scenarios are selected by the user from the library of optimization 
templates parameterized by input parameters and various options on the user's 
menu. The user's interface includes a scenario menu that enables him to select 

15 optimization scenarios or prediction scenarios including various input parameters, a 
means for interaction with the computation engine, and a report generator that 
presents optimum prices or predicted prices and accompanying relevant results in 
various formats. The system combines a rich store of optimization and prediction 
scenarios with real time information processing in which most recent data are 

20 merged into historical data acquired previously via interface with the system 
database. 

A crude map of the system's functioning is presented in Fig. 1. At a preliminary 
stage, the system performs data-mining of the existing historical database (Block 
101): it accesses the historical database (Block 102) and applies data-reading and 

25 data-mining procedures that setup a secondary working database containing data 
relevant for regression modeling (Block 103). Later this secondary database will be 
updated and used continuously instead of the original database. The system's 
timeframe consists of two major periods: Initial Period and Main Period. It is 
assumed that at the Initial Period (Block 104), there are available sales data like 

30 scanner data for some previous time period, say, one or two years. Those data 
should include common sales conditions such as days of the week, holidays, etc., 
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and basic sales information items like quantities, brands, prices, pointers for items 
being put on sale at clearance prices, etc. On the other hand, it is not assumed that 
detailed data on promotional activities have been stored in the database. Therefore, 
while prices could be optimized during that period, promotion schedules would be 
5 generated by the Initial Promotion Scheduling Procedure that does not require 
historical data on promotion activities. Subsequently, when sufficient sales data have 
been collected for the promotion optimization module to function properly, the 
potentially infinite Main Period starts (Block 106). 

At all stages, the user can communicate with the system via User's Interface Module 
io (Block 105) by selecting optimization or prediction scenarios and input parameters, 
and receiving customized reports with optimized prices, sales predictions and other 
relevant information of various degrees of detail (Block 107). 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1: Joint Pricing and Promotion System: Overview 

15 Fig. 2: Joint Pricing and Promotion System: Major Computational Steps 

Fig. 3: Joint Pricing and Promotion System: Major Components 

Fig. 4: System Database 

Fig. 5: Pricing: Regression Model Construction 

Fig. 6: Pricing: Testing for Batch Set Admissibility and Regressor Relevancy 
20 Fig. 7: Pricing: Regression Modeling and Objective Function Construction 
Fig. 8: Pricing: Group Significance Testing 
Fig. 9: Promotion Scheduling Procedure 

Fig. 10: Pricing-Promotion Estimation-Optimization Loop: Initial Period 



Fig. 11: Pricing-Promotion Estimation-Optimization Loop: Main Period 
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Fig. 12: User-System Interaction via Price Optimization Scenarios 

Fig. 13: User-System Interaction via Price Prediction Scenarios 

Fig. 14: User-System Interaction via Promotion Scheduling Scenarios 

DETAILED DESCRIPTION OF THE INVENTION 

5 This invention relates to the field of econometrics, and more particularly to a system 
and method for determining optimum prices for a set of products within a product 
category in a store, where the optimum prices are defined as the prices that 
maximize a merchandising figure of merit such as revenue, profit, or sales volume. 
Along with optimization of prices, this invention provides a means for concurrent 
io demonstration of promotion clips on in-store monitors in such a way that joint 
influence of pricing and promotion is optimized. 

For achieving these goals, the present invention provides flexible techniques for 
configuring optimization scenarios from the user interface, determining a set of 
optimum prices corresponding to the scenarios, and concurrent optimal clip 
15 demonstration scenarios, and displaying those optimum prices in a user-friendly 
form together with other relevant information of potential interest to the user. 

We begin with an example. 

Example: Clearance Pricing Optimization Problem 

Suppose it is decided to sell out a number of product items in a supermarket within a 
20 particular target time period called clearance period, and to maximize the total 
revenue during that period by setting up optimal prices (for a theory for clearance 
sales, see, for instance Gupta, Hill, and Bouzdine-Chameeva, 2002). Along with 
adjusting prices, it is possible to use other means for enhancing sales like mobilizing 
in-store advertising means. 

25 It is assumed that historical statistical data like scanner data are available on sales of 
those items for past periods. Besides sales volumes, those data are presumed to 
include the corresponding price levels, some concurrent external conditions that 
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could affect sales volumes, for instance, time characteristics like season, day of the 
week, holidays, and also such factors as special discounts, advertising activities, etc. 

A simpler version of this example will be selling a single product whose revenue we 
wish to maximize during a prespecified clearance period, say, one week. The product 
5 will be on sale for one week at a markdown price after which the leftover stock is 
sold out at a very low price (salvage price) or returned for a low compensation. 

First we give a general approach to solving suchlike problems in pricing optimization 
that includes data mining, regression model construction, regression fitting and 
estimation, objective function construction, pricing optimization, and error 

10 estimation. Then we describe a statistical-optimization system that uses 
measurements of in-store promotional activities stored in the database for the goal 
of optimal allocation of clip demonstration times at in-store TV monitors. After that 
we present an integrated statistical-optimization system that includes both pricing 
and promotion optimization subsystems and utilizes them for revenue optimization 

15 from sales of a category of products. Finally, we describe the user interface module 
that allows the user to select desirable optimization or prediction scenarios, feed 
input parameters and options into the system, and to receive customized reports 
with optimal prices, predicted values and other information. 

Assume that the product's weekly revenue R is a known function of its price p and 
20 weekly demand y : 



Assuming that the product's demand is stochastic but can be adequately modeled by 
a price-demand function known up to parameter values, we can construct a 
regression model relating the weekly demand to product's price and a number of 
25 other relevant factors. Using historical sales data, we can estimate model 
parameters, and obtain a demand prediction function 



R = R(p,y) 



(l) 



y = y(p>g) 



(2) 
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where g is the vector of other relevant factors. Substituting equation (2) into (1) 

and suppressing the dependence on the known vector of other factors, we can write 
the revenue as a function of price alone: 

R = R(P) O) 

5 Direct maximization of revenue R in (3) is not possible since it is random; instead, 
we will work with its mean value 

R=R(p) (4) 
Maximizing R in p will produce an optimal expected revenue 

R opt =mzx(R(p)) 

10 and an optimal price to be charged for the product: 

p opt =argmax(£(/?)) 

The main purpose of the invented system is maximization of a preferred 
merchandising figure of merit like revenue, profit, etc. Mathematically, it is 
optimization of a selected objective function. For simplicity we will be considering 
15 maximization of revenue since other figures of merit may be dealt with quite 
similarly. If, say, maximization of gross profit G is desired, we have first to know 
the functions that express gross profits as a function of price and demand for each 
product: G = G(p,y) (similar to R in formula (1)). Then we can substitute the 

demand functions in (2) into those functions and perform optimizations as before for 
20 revenue in (3)-(4). Of course, numerical results, i.e. the optimal prices may be 
different. We consider the cases: single product vs. product group, regular price vs. 
clearance price, and simple period vs. composite period. 

Pricing Optimization 



MAK-106US - 17 



Fig 2 presents six major steps of the "pricing component" in the statistical- 
optimization system: 

Step 1: Data Mining 

Step 2: Regression Model Construction 

5 Step 3: Regression Fitting and Estimation 

Step 4: Objective Function Selection (Construction) 

Step 5: Optimization 

Step 6: Error Estimation and Significance Testing 

We will briefly review these steps using the single product example above. We will 
10 start with Step 2 postponing discussion of the preliminary Step 1. For constructing a 
useful regression model, we need to set up a regression design matrix (X matrix), a 
response vector {Y column), and a noise covariance matrix. The X matrix 
represents the sales conditions, or business influences, the Y column provides the 
weekly demands, i. e. sales volumes, and the noise covariance matrix reflects 
15 demand uncertainty. 

The rows of the design matrix represent weekly sales conditions. To capture effects 
of those sales conditions, a number of regressor variables, or factors, representing 
them will appear as matrix's columns. Only relevant factors qualify for inclusion, i. e. 
the factors that could help in predicting the response and whose values can be 
20 uniquely determined for each week. Moreover, each of those factors has to have 
been observed on a number of different levels. 

It is well known in econometrics that log-demand (logarithm of demand) should be 
used as the regression response, rather than demand: it describes demand better, 
helps to avoid a problem of negative demand values, and results in better properties 
25 of estimates. Therefore, we will be using the following regression model: 



log7 = Xp + np + s 



MAK-106US 



- 18- 



Here X is the matrix of the variables X ]9 X 2y ...X k representing concurrent sales 

conditions that could affect sales volumes like season, day of the week, holidays, 
product brands, various discounts, promotion activities measured on some scale, etc. 
Some of those sales conditions may be qualitative, i.e. e. categorical, while others 
5 quantitative. A particularly important variable is product's price p . The noise 
covariance matrix is assumed to have the structure 

cov(s) = a 2 diag{\ I w x ,...,1 / w n ) (5) 

where the weights w i are presumed to have been estimated prior to fitting the 
regression. 

10 At Step 3, the regression model is fitted by a WLS procedure, and we obtain a linear 
predictor for the log-demand in the form 

logj> = x r /? + pn 

At Step 4, an appropriate objective function is selected based on particular 
circumstances and on user's preferences communicated to the system via user's 
15 interface. Possible forms of objective functions are described in section Objective 
Function Construction below. 

Proceeding to Step 5, we assume that the values of sales conditions for the target 
period are known or have been estimated. Then as seen from equation (4), the 
product's predicted mean revenue can be expressed as a function of price alone, and 
20 after performing optimization we obtain optimal values for revenue and price. 

After optimal revenue and optimal price have been computed, error estimation and 
criticism of results (significance testing) are performed at Step 6. Since all obtained 
results are based on historical data, they are uncertain. Therefore, they have to be 
tested for significance, to make sure that the proposed price adjustments are not 
25 resulting exclusively from noise in data. The corresponding error estimation and 
hypothesis testing are described in section Pricing Optimization below. 
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Returning to Step 1, the data mining procedures scan the database for records on 
sales and on various attributes that may be used for representing sales conditions. 
They select data items from the records and arrange them in batches by weekly (or 
other) periods. While performing these tasks, data mining procedures use predefined 
5 patterns for constructing relevant factors to be included into regression model. These 
factors will be constructed by special procedures by converting attributes and 
measurements available in the database. Selection of relevant, i.e. available and 
useful, regressors depends on the target period on which the factors are going to be 
available, and on availability of those factors in the historical records. 

10 There are a number of ways in which attributes available as field values in the 
database can be used in model construction. First, some attributes may be directly 
included as values of regressors into the design matrix. Second, dates in the 
database records linked to a calendar will enable to construct additional regressors 
such as day of the week, season, pre-holiday, etc. Third, some decisions on 

15 regression variables may be made based on the information available on the sales 
conditions during the target period. Finally, some additional data preprocessing, 
structuring, aggregating and condensing of data may also be performed at this 
stage. 

Regular Pricing 

20 Besides clearance price optimization, we have a problem of regular price 
optimization. Some of the differences as compared to clearance price are abundant 
inventory, and undetermined (unlimited) target period. Though no explicit target 
period is defined in clearance price optimization, we have to have a target period for 
the purposes of structuring historical sales data to be included into similar batches. 

25 Since sales activities are evolving around business week cycle, it is natural to define 
target period as one regular week. Regular week means a full business week with no 
holidays, pre-ho!idays, post-holidays, or any other features that might significantly 
influence sales activities. The part played by the target period in regular price 
optimization is limited: essentially, it is no more than a framework for extracting 

30 relevant data from the database. 

Category Pricing Optimization 
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In reality, products come in categories rather than in isolation: an oat meal belongs 
into category cereal, cheddar belongs to category cheese (to a subcategory fat 
cheese), etc. Products in a common category tend to exhibit substitute demand, so 
that changes in sales of one product may directly affect sales of other products in its 

5 category. On the other hand, inter-category substitution is relatively low. As a result, 
it would be highly unrealistic to try to maximize revenue by adjusting prices of 
isolated products as described above. If the price of one product in a category is 
reduced, and as a result its sales and the corresponding revenue increase, it will 
probably also imply that sales of substitute products will go down which will make 

10 the category total revenue unpredictable. A similar but somehow inverse effect takes 
place for products with complementary demand, for example, gin and tonic, or 
alcohol and snack food. To correct for these effects, it appears necessary to consider 
revenue optimization of groups of substitute or complement products rather than 
single products. In so doing, we will be taking into account the effects of price 

15 adjustments of each product on the group as a whole, i. e. the changes in demand 
for all products in the group resulting from pricing changes in one or more products. 
The subsequent revenue optimization will also be done over all products in the 
group. The product category or a subcategory of manageable size appears a natural 
candidate for a group. 

20 Earlier we mentioned two kinds of factors that could affect product demands: factors 
outside of our control like days of the week, and those like prices that are completely 
controlled by us and directly affect sales. However, there are additional factors like 
in-store promotional activities that are also controlled by us and can affect sales, 
though perhaps less directly and to a lesser degree than pricing adjustments. Now 

25 we will sketch a second model that will allow us to bring in a formal representation of 
promotion influences and to optimize them by optimizing in-store promotion 
scheduling. For the time being, we are leaving out the pricing optimization and are 
concentrating on promotion activities. 

Promotion Scheduling Optimization 

30 Assume that it is decided to promote a number of product items in a supermarket by 
running relevant commercial clips on a number of TV monitors stationed in various 
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isles. At any given time period a TV monitor can carry a certain amount of clip 
demonstrations on a selected group of items, probably more than one demonstration 
of the same clip. Assuming that in general clip presentations tend to increase sales 
(though obviously to a limited degree), it is desired to develop clip presentation 
5 schedules for a set of in-store monitors that optimize promotion influence and 
thereby maximize overall revenue from sales of the promoted products. 

Let the products that are going to be promoted be numbered d = 1,2,. ..,£>. Then if in 

a given time period, a quantity y d of product d is sold at a price p d (price can be 

replaced by profit margin if desired), he overall revenue from sales of all products in 
10 the group is 



In order to be able to maximize the revenue by optimizing promotion scheduling, we 
have to model influence of promotion schedules on product demands. This will be 
done by setting-up the following regression model. 

15 Let X = X dj be the scheduling matrix giving the time devoted to promoting product 
d within a given time period on a monitor j , j = \,...,J , y dj be the sales volume of 
product d that resulted from its promoting on monitor j, and f dj (.) the 
corresponding influence function: 



20 Since the cumulative sales of product d that resulted from all demonstrations on all 
monitors can be expressed as 



D 




(6) 



y dJ = fdj(x<y) 



(7) 



j 



yd =Z^- 



(8) 



we can substitute (7) and (8) into (6) and get 
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d=\ y-i 

The influence functions f dj {) can be estimated by locally weighted regression 
methods, and then by going from random revenue in (6) to expected mean 
revenue/? , we write it as function of the scheduling matrix X : 

5 R = R(X) (9) 

Optimization of mean revenue in (9) can be carried out by mathematical 
programming techniques under appropriate restrictions like nonnegativity of 
demonstration times, physical restrictions on demonstration times on each monitor, 
etc. An obtained optimal solution X dj for all d and j will give us an approximation 

10 to the "true" optimal schedule. 

Now we will outline a more comprehensive model that will allow to bring both pricing 
and promotion optimization systems into a common framework, and will enable to 
combine the beneficial effects of both. 

Joint Pricing-Promotion Model 

15 A generalization of the two models above is a comprehensive joint multivariate 
regression model having the formal structure 

U = GT + XB + PTl + E (10) 

where U is the vector of log-demands for a given category of products, G is the 
common factors design matrix (containing the common sales conditions), T is the 
20 common factors regression coefficient matrix (or common effects matrix), X is the 
individual factors design matrix (containing promotion effects), B is the individual 
factors regression coefficient matrix (individual effects matrix), P is the price design 
matrix (matrix of own and cross price effects) whose rows are category price vectors 
at past periods, n is the price regression coefficient matrix, E is the noise matrix. 
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For optimization of pricing as well as of promotion scheduling we need to construct 
predictions of demand by fitting equations (10). It will be done in a stepwise manner 
in a real time loop, say, daily, using the historical sales data up to the current day. It 
will be assumed that at the very start, there are available sales data for some 
5 previous period, say, one year. The sales data contain the sales conditions data that 
fill the matrices G , X and P , as well as sale quantities U . After a latest fit of the 
equations (10), we obtain estimates of Y , B and II, and can construct a linear 
predictor for the category log-demand vector u in the form 



io where the vectors g, x and p provide common sales conditions, promotion 

conditions and category prices respectively for a target period for which prediction is 
being sought. From (11) we can express the category demand vector j; as a 

function of promotion schedules x and price vector p : 



15 Now we can optimize category revenue in (6) in x and p under appropriate 
restrictions and obtain optimal promotion schedules and prices for the target period. 

Pricing and promotion estimation-optimization steps can be performed consecutively 
rather than simultaneously which means that at pricing step the promotion-related 
quantities are presumed known and do not change; conversely, at promotion step all 
20 price-related quantities are considered constant. Technically, at the pricing step, the 
fitted equations are 



u =gt + xB + pfl 



(U) 



y = f(x,p) 



(12) 



with 



V = U-XB 



(13) 



25 where B have already been estimated and therefore are assumed known. At 
promotion step, the fitted equations are 
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z = xp 



(14) 



with 



z = u-Gr-m 



(15) 



where f and n have already been estimated. 



5 This approach is shown schematically in the flow-chart in Fig. 3 and described in 
detail in section Pricing-Promotion Estimation-Optimization Loop below. Block 301 
shows the system database containing historical data and current (latest) data. The 
data obtained from data mining in Block 302 is fed into the computational engine in 
Block 303. The computational engine receives instructions and parameters from the 
10 user in Block 305 and provides (modified) instructions and parameters for the 
computational module in Block 304 containing the main system model and the 
computational loop. It produces promotion schedules (Block 306), pricing schedules 
(Block 307), and a new batch of data for the secondary database in Block 308. After 
the loop is completed, control returns to the computational engine in Block 303. 

15 This approach is preferred for the following reasons: 

1. It can be assumed that at the very start, there are available sales data for 
some previous period, say, one or two years. On the other hand, it is not assumed 
that data on promotional activities have been stored in the database. So, while 
equations (12)- (13) can be fitted, equations (14)- (15) cannot, at least, for the time 

20 being. Therefore, it appears natural to introduce an initial period during which prices 
could be estimated and optimized while promotion schedules would be generated by 
an Initial Promotion Scheduling Procedure. 

2. Different types of regression may be used for estimation in pricing and in 
promotion scheduling: linear for log-demand in pricing, and locally weighted in 

25 scheduling. Either of them can be modified or extended in various ways if desired 
without affecting the structure of the other. 
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3. Time periods and the corresponding matrix rows have natural ordering in 
pricing optimization. There is always one current price and any price changes can be 
only affected relative to it. In contrast, the time ordering of a sequence of promotion 
schedules does not seem to carry much weight. Neither does the current schedule. 

5 Next schedule construction may start from any recorded schedule, and as a result 
attempts at schedule optimization may cause large jumps in the scheduling space. 

4. Effects of price changes on sales appear to be much stronger and more 
unambiguous than those of promotion displays. Therefore, it seems necessary to 
provide safeguards against spurious effects of promotion scheduling. For preventing 

10 false moves in the schedule space, we introduced two modifications: First, more 
promising existing schedules are selected as candidates for starting points in 
construction of the next schedule, and second, statistical hypothesis testing is done 
for checking significance of an expected revenue increase. If significant, the 
corresponding promotion schedule as computed by the optimization leg is suggested 

15 as the next schedule; otherwise, a new schedule in the vicinity of the existing 
schedules is constructed with the sole purpose of increasing the density of the data 
points in the exploration region and giving a better chance of obtaining significant 
revenue increase at a later period. 

Contents of Database 

20 The use of historical sales records stored in a database is crucial for constructing a 
regression model, with subsequent estimation of parameters, and calculating 
expected revenues, expected demands, and other quantities of interest. It is 
presumed that the available historical data contain at least scanner data and 
probably some extra data. Scanner data are electronic records of transactions that 

25 are collected as part of business operation. The most familiar and now ubiquitous 
form of scanner data is the scanning of bar codes at checkout lines of retail stores. 

The historical data are stored in a file in some standard format and consist of sales 
records organized by dates and containing a number of attributes. Alternatively, they 
may be stored in a set of files in different formats, but then the control system 
30 should be able to query those files simultaneously, to merge records, etc. We 
consider first the case of a single product item, and later a group of items. We can 
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visualize (Fig. 4) the system database (Block 401) as a table containing the following 
groups of attributes: Sales conditions (Block 402), promotion data (Block 403), and 
sales data (Block 404). Sales conditions contain such relevant attributes as SKUs, 
prices, time, day, weekday type, product brands, discounts: brand, quantity, 

5 package, pre-holiday discounts, etc. (SKU is Stock Keeping Unit, i. e. a number 
associated with a product for inventory purposes or, alternatively, relating the 
product to any arbitrary numbering system capable of uniquely identifying it.) 
Promotion (advertising) data include SKU, product group, product category, TV 
screen location, time of day, and frequency of clip demonstrations. Sales data 

10 include SKU, sales volume, and time of day. The secondary database containing the 
required data in a convenient form is shown in Block 405. 

Structure of Regression Models, and Data Mining 

Historical data on sales stored in a database is used for constructing regression 
models. There are two major components to regression models: relevant factors, or 
15 regressors, and data batches that define rows of regression matrices. Both these 
components are constructed by data mining procedures that extract data from 
databases and arrange them into predefined patterns using statistical and logical 
criteria and also built-in expert knowledge. 

A regression design matrix is constructed based on the records in the table, i. e. 

20 database, and on any relevant additional info available. First, some of the above 
attributes may be included as values of regressors into the design matrix. Second, 
dates in the table linked to a calendar will enable us to construct additional 
regressors such as day of the week, season, pre-holiday, etc. Third, some decisions 
on variables are based on the information available on the sales conditions during 

25 the target period, i. e. the period for which the prediction and planning are going to 
be made. 

Assume that pricing optimization has been planned for one week. This means that a 
target period has been set at one-week period and therefore all expected quantities 
of interest have to be calculated for one week ahead. Then the historical data will be 
30 broken into weekly periods, and weekly data batches will be used in regression 
models. The response column will be logarithms of cumulative weekly sales. 
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For constructing the design matrix, it should be decided what regressors (or 
variables, or factors) it should contain. Selection of relevant, i.e. available and 
useful, regressors depends on availability of those factors both in the historical data 
and in the target period. Regression factors may be classified into the following 
5 groups: general factors like day of the week, or pre-holiday day, prices, promotion 
activities quantified as necessary, and other factors like product brand, quantity 
discounts, etc. Promotion activities in our sense are clip demonstrations on in-store 
monitors and do not include any other forms of promotion, so that quantity discounts 
and other factors which might be viewed as promotion are grouped separately. 

Some of the regressors are quantitative, others qualitative (or categorical), and 
some may be both. Besides obviously quantitative variables like price, the regression 
model will contain additional variables for representing influences originating from 
package discounts, special pre-holiday discounts, advertising activities, differences 
between days of the week, seasonal influences, etc. Price is among the most 
important quantitative regressors. Days of the week, seasons, pre-holidays, package 
discounts are qualitative and thus representable by categorical variables. Quantity 
discounts can be quantitative, qualitative, or both. Promotional activities, their 
structure, characteristics, localization and intensity may be modeled by a 
combination of categorical and quantitative variables. 

20 Relevant Factors 

A factor is called relevant if 

1. It is available in the historical database; 

2. It is available within the target period; 

3. Its value is known by the beginning of the target period and does not 
25 change during it. 

In our one-week example, we can include season and pre-holidays among relevant 
factors and also factors like product brands, quantity discounts, package discounts, 
special pre-holiday discounts, advertising activities if the values of those factors are 
going to be fixed during the target period. Then the regression design matrix will 



MAK-106US 



-28- 



have rows corresponding to past weeks, and the following regressors: Season, 
holiday, quantity discounts, package discounts, pre-holiday discounts, advertising, 
and price. 

Besides regressors that are always used in the regression, there may be others 
5 whose presence is determined by various circumstances such as measurement 
availability, ease of computation, or user's assumptions, for example, regressors that 
model time explicitly. Such regressors are added at the model construction step. 

Time influence can be modeled in a number of ways. Under assumption that there is 
no discernible time trend, i. e. sales volumes and conditions do not change much 

10 within the overall time period, no direct modeling of time appears necessary. On the 
other hand, if time trend is known to exist or is suspected, time may be modeled as 
a discrete quantitative variable numbering, say, weeks, or any other preferred time 
periods. Treating sales histories showing seasonal influences will require introducing 
polynomials in time into the regression. Alternatively, rather than introduce time 

15 factor explicitly, it is possible to use generalized, or weighted, least squares for 
estimation where smaller weights are given to older records. 

Similar Time Periods, Similar Batches, Admissible Batches 

Similar time periods will be defined by their similarity to the target sales period, and 
will have duration of that period. Sales activities like many other business activities 

20 are evolving around weekly business cycle. This basic weekly cycle is modified by 
other factors like holidays, seasonal influences, etc. Sometimes, however, one may 
be better off by defining similarity for shorter periods like, say, separate days of 
week, or even parts of the workday. On the other hand, while handling the data on 
Christmas sales, say, one may want to have similarity defined by a modulo one year. 

25 In general, definition of similarity for time periods should be based on a priori 
information, expert knowledge of subject matter, and also on relevant statistical 
considerations. 

Subsets of sales records associated with a given time period are called batches. 
'Associated 1 here means that the corresponding sales took place within time 
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boundaries of the given period. The batches associated with a set of similar periods 
are called similar batches. 

Before estimating a model it is desirable to check if the quantity and quality of the 
available data are satisfactory, otherwise, the model cannot be used for prediction. A 
5 set of data batches is admissible, if they provide sufficiently good data for estimation 
and prediction. The corresponding verification will be done by an Algorithm for Batch 
Set Admissibility and Regressor Relevancy described below. 

Simple Target Periods versus Composite Target Periods 

Sometimes breaking down historical data into time periods similar to the target 
10 period is not feasible or is not beneficial from the statistical point of view. In such 
cases we will conceive of a given target period as a composite period consisting of a 
number of subperiods. 

Consider an example. The selected target period is one week and we have historical 
data on 10 weeks but the prices used to be changed in the middle of the week, say, 
15 on Wednesday morning. Then operating in terms of the one week time period we 
cannot use the historical data at all since no price value can be attached to any past 
single week period. 

These difficulties arise because the requirement to be always able to break down 
historical data to periods similar to the target period is too rigid. Therefore, it would 

20 be beneficial to conceive of the target period as consisting of, say, three one-day 
periods. Then we could use all the data available on Mondays for estimating 'Monday 
parameters', and all the data available on Tuesdays, Wednesdays and Thursdays for 
estimating 'regular day parameters'. Thus, the composite target period approach first 
partitions the target period, then performs data mining and regression modeling and 

25 estimation separately for those subperiods, and then constructs the overall revenue 
function for the whole target period and maximize it. 

Modeling of Promotional Activities 

Assume that product items are promoted by running commercial clips on a number 
of TV monitors stationed in various isles. At any given time period a TV monitor can 
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carry a certain amount of clip demonstrations on a selected group of items, usually 
more than one demonstration of the same clip. The time dedicated to promotion of a 
given product at a particular monitor within a given time period could be used as a 
quantitative regression variable. For a group of monitors, this would produce a 
5 multiple regression model with locally weighted regressions as influence functions. 

Since influence functions in promotion scheduling may be non-stationary, e. g. vary 
considerably from hour to hour and possibly among days of the week within the 
same hours, it may be helpful to structure the time grid for clip demonstrations 
accordingly. Thus, we can view the week as a recurrent time frame consisting of a 
10 fixed series of different days of the week or daytypes. Furthermore, we may partition 
a working day into, say, four supposedly homogeneous time blocks: from the 
opening hour to 9 a.m., from 9 a.m. to 4 p.m., from 4 p.m. to 7 p.m., and from 7 
p.m. to the closing. Then promotion influence functions can be assumed stationary 
inside each time block. 

15 Setting-Up Secondary Database 

Mining the whole available historical database is a time-consuming and expensive 
process. After initial data mining of the historical master database, it may be 
advantageous to set-up a secondary working database that contains only the data 
necessary for estimation-optimization computations. Such data will be stored in the 
20 form most suitable for reading and processing, and moreover, the newly obtained 
current data will be immediately converted into this form obviating the need to 
extract them from the master database. 

Pricing Regression Modeling 

Optimization of revenue requires construction and fitting of regression models, 
25 estimation of all relevant parameters and making predictions based on them. Here 
we consider construction and fitting of those parts of joint regression models that 
relate demand to prices. For specifying a regression model, one needs to provide a 
form for regression function, regression data batches (corresponding to rows of 
regression matrix), regression factors (corresponding to columns of regression 
30 matrix), and assumptions on covariance matrix (noise structure). For any product on 
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sale, the form of regression function is given by a price-demand function known up 
to parameter values. We consider regression modeling of individual products first, 
and then the changes needed for modeling a group of interrelated products. 

Regression Model Construction for Individual Products 

5 As explained previously, the regression model contains rows (jc f/ u n w,.), i =!,...,« # 
corresponding to all similar batches (with log-demands w / =log( t y / )) where x i 
describes sales conditions, y.. sales volumes, and w i the weights associated with 
corresponding periods. For a similar batch containing k records with sales volumes 

y^y 2 ^y k ( 16 ) 

10 its sale volume is the sum of the sales volumes in all k records in (16): 

k 

Regression model construction is shown in Fig. 5. Input to the procedure contains a 
list of data items, parameters, default values, and user-selected items of which 
major ones are shown in Block 501. Based on this input information and built-in 

15 procedures, additional regression factors are constructed in Block 502. A set of 
similar batches is constructed from contents of the database based on selected 
parameters, and user-supplied items and instructions in Block 503. Decision to 
proceed with simple period or composite period is made in Block 504, batch set 
admissibility and regressor relevance are tested in Block 505, and finally the noise 

20 covariance matrix is estimated in Block 506. An output of the procedure that is the 
constructed regression model is shown in Block 507. 

Testing for Batch Set Admissibility and Regressor Relevancy indicated in Block 505 is 
shown in Fig. 6: 

1. Input: Set of similar batches, List of candidate relevant factors; 
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2. For all batches, check if the product was available for sale during the 
corresponding period; if yes, mark that batch as 'admissible'; if not, as 
'inadmissible'; 

3. Check if all the selected relevant factors are defined and have good 
values for all records in the similar batches. If some relevant factors 
are undefined, missing, have outlying values, etc. for some records, 
label them accordingly and apply methods for dealing with those 
problems in the X matrix if the problems are not too severe; if they 
are, discard corresponding records or the factors themselves; 

4. Check if each relevant factor has at least two measured levels among 
the similar batches; if not, label it 'irrelevant' and discard it; 

5. Check if no selected relevant factor has two or more different values in 
a single batch; if it happens, exit with a corresponding error message; 

6. If the price factor does not have at least two levels, declare the batch 
inadmissible; 

7. Check if there are enough batches for fitting regression. A simple 
formula may be adopted for defining the minimal number of batches, 
say, 2k + 3, where k is the number of regression parameters to be 
estimated. So, if n < 2k + 3 , declare the batch set inadmissible; 

8. Output: Logical output: admissible/inadmissible batch set, Error 
message describing reasons for inadmissibility, Set of similar batches 
(may be a subset of the initial set on the Input), List of relevant 
factors (may be a subset of the initial list on the Input). 

Estimation of the Covariance Matrix 

Our problem does not support the assumption of homoscedasticity of variance since 
if days of the week are represented by regression indicators, then different sales 
volumes in, say, Mondays and regular days will influence the corresponding 
variances. On the other hand, if separate regressions are fitted for various daytypes, 
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widely different sales volumes resulting, for instance, from a heavy promotion 
campaign may invalidate homoscedasticity assumption. Therefore, we will be 
assuming the covariance matrix structure in formula (5) where the weights are 
presumed to have been estimated prior to fitting the regression from formula 

5 w~l/(coeff _var) 2 (17) 

where 

( coe ff _var) 2 = var(t/) l(mean(u)) 2 (18) 

Now we have to estimate mean{u) = mean(log(y)) and var(w) = var(logOO) . 
Assuming that single sales volumes are randomly identically distributed with mean 
10 //, and variance S* , a batch sales volume is normally distributed as y ~ N(/u,S 2 ) 
by the Central Limit Theorem with parameters 

// = £//, and S 2 =kS } 2 (19) 

We will use robust estimators for mean /u x and variance S } of the form 

fa = med(y ]9 ...,y k ) (20) 

15 and 

^ 1 =(/nerf(|j; ; -// I |))/O- , (0.75) (21) 

where med is median, and O" 1 is the inverse of the standard normal distribution 
function. 

Algorithm for Estimation of Variance Weights: 
20 1. Compute estimates for and 5* by robust formulas (20) and (21); 

2. Compute estimates for ju and S 2 by formulas (19); 
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3. Compute mean(u) and var(u) by a numerical integration routine; 

4. Compute weightsw by formulas (17)-(18). 

Regression Model Construction for Groups of Interrelated Products 

As indicated above, products come in categories with substitute and complementary 
5 demand, and an efficient modeling has to take it into account. Therefore, we will 
consider revenue optimization of groups of substitute or complement products rather 
than single products. In so doing, we will be taking into account the effects of price 
adjustments of each product on the group as a whole, i. e. the changes in demand 
for all products in the group resulting from pricing changes in one or more products. 
io It is implemented by regressing the demand of each product on the category price 
vector and then optimizing the category total revenue. Of course, the number of 
regressors in the regression model increases, and optimization must be performed in 
a multidimensional space of category price vectors. Thus, category pricing modeling 
and optimization are much more difficult than single product pricing optimization. 

15 Using the properties of the least squares we can rewrite the multivariate regression 
model (10) as a system of D univariate regression models each corresponding to a 
single item d , for d = : 

U d =Gy d +X d fi d +Px d +e d (22) 

where P is an nxD matrix modeling the effects of a product's own price as well as 
20 the category's cross-prices the rows of which have the form 

1/(1 + 1/0 + /V.) P* W + P M ) .» 1/0 + Pd) 

and 7t d is the corresponding Dx\ vector of regression price coefficients. 
Regression Fitting and Estimation 

The constructed regression models for all products in the selected group are fitted by 
25 restricted weighted least squares. 'Restricted' since the price sensitivity should be 
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always positive, and 'weighted' as the covariance matrix of noise is not scalar. 
Although the least squares procedure is not robust to outliers, the resulting overall 
procedure is robust due to robust batch estimators in formulas (20) and (21). 

For computing a predicted revenue function for the target period, we have to 

5 estimate the variance <j t 2 for that period, which is equivalent to estimating an 

unknown variance weight w T at a "new" point. This in itself is a prediction problem 

and can be solved by one of standard methods, such as the nearest neighbors 
method. 

Regression Variable Selection via Cross Model Validation (CMV) 

10 The meaning of the common price design matrix is that the sales of any item in its 
category may be affected by all cross-prices in that category. These intracategory 
relationships enter via explicit dependency of each item's sales on the category price 
vector rather than on its own price alone. Though a priori the category price matrix 
will contain all cross-prices, it may happen that in fact only a small subset of prices 

15 really influences a given product's sale, the remaining category prices being 
superfluous. Those extra regressors will inflate predictor variances thereby causing 
deterioration in statistical properties of an estimated model and as a consequence in 
the properties of an optimized solution. To deal with this problem we apply a Cross 
Model Validation approach to variable selection described by Hjorth (1994). Variable 

20 selection will be applied in our model only to cross-prices, own price and other 
primary regression variables always staying in the model. 

Assume that we have regressors /? = 1,...,P, and an estimation set ES containing 
observations We distinguish between regressor subsets S and models m. If 

we have an estimation procedure, e.g. OLS, or SVD (modified OLS), we obtain a 
25 model m from a regressor subset S by applying this estimation procedure to S . If a 
subset S has p regressors, we say that it has size p and write #S = p. Often while 
searching through all subsets of a given size we will have to enumerate them in 
some way: S Np where N p =#{S :#S = p) is the number of subsets of size p . 

Each model m produces prediction y for any data point x : y = m{x) . We denote by 
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ESj the data ES with deleted observation j . For any regressor subset S and any 
observation j we have a model m(S y j) obtained by applying the estimation 
procedure to subset S on data ES jt Applying this model to a data point x jf we 
obtain prediction yi(m(SJ)) . For any model m(SJ) and data set ES jt we define an 
5 Internal Quality Criterion, or a Quality Measure of the model: 

IC(m(S, j\ ESj ) = X ( j) f . (m(S, j)) - y, ) 2 

Algorithm for Variable Selection via Cross Model Validation 

1. For all /?and j , compute the best model M(p,j) by criterion IC ; 

2. For each /? = 1,...,P, compute the Cross Model Validation measure 
10 CMV(p) that accounts for the search for a best model for each size 

P- 

CMV(p) = 1 / ( yj (M(p, j)) - yj ) 2 

3. Calculate optimal model size p Q by minCMV(p) over all sizes p ; 

4. Compute the final best model of size p 0 . 

15 Objective Function Construction 

We consider the cases: single product vs. product group, regular price vs. clearance 
price, and simple period vs. composite period. 

Single Product 

Under the regular price optimization we have abundant inventory as opposed to 
20 scarce inventory under clearance price optimization, and undetermined (unlimited) 
target period. 
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Single Product — Regular Price — Simple Period 
Predicted revenue is computed as 

R = pexp(u T +a T 2 12) (23) 

that follows from the expression for mean of log-normal distribution. All the 
5 quantities in (23) are known or presumed to have been estimated except the price 
vector p that will play the part of control variables. Predicted mean log-demand u T 
is estimated as 

=gT T 9 + P& ( 24 ) 

The values of factors in the common factors vector g T are known or have been 
io estimated. Estimates y , n and a 2 have been obtained from regression fitting as 
described above. A variance estimate of log-demand for the target period (regular 
week) in formula (23) is computed as 

a 2 - a 2 i w T (25) 
where the weight w T is estimated as 

n 

15 "V=(Z w i) /w ( 26 ) 

Now the revenue optimization problem can be easily solved by maximizing the 
predicted revenue R in (23) in p under additional default or user-defined 

constraints on feasible prices. As a result, we will obtain the optimal revenue R reg 
and the optimal regular price p reg . Local maximization is done as follows: First, a 

20 neighborhood of the current (or initial) price p^ is defined. This neighborhood 

should either be restricted by the range of historically recorded prices, or it should 
allow limited extrapolation, say, at most 25% outside of that range. Revenue 
maximization will be done over the neighborhood, or probably over its subset if there 
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are additional a priori restrictions on prices. In other words, optimization of revenue 
will be done by searching for a local maximum of R reachable from p^ . Predicted 

sales volume (demand) V for a regular week may be computed by formula 

Ke g =exp(w r +o> 2 /2) 

5 In this formula for V reg , the mean log-demand u T is computed by formula (24) in 
which the optimal price p reg has been substituted for price p . 

Single Product — Regular Price — Composite Period 

The predicted revenue for a composite period is computed by formula 

io where each revenue R. for a subperiod i is computed by formula (23) 

R. = p exp(w, T + a iT 1 2) 

Similarly for predicted sales volumes (demands). Parameters a iT and a iT 2 are 

estimated for each component subperiod separately as described above for simple 
target period. 

15 Single Product — Clearance Price — Simple Period 
Predicted revenue is computed by formula 

R = pl-(p- s)KZ>(b T + o> ) + (p - s) exp(u r + o> 2 / 2)0(6 r ) (27) 

where u T is the predicted mean log-demand computed as in (24); 
b T =(log/-M r -<j t 2 )/<j t ; Gj is the predicted (estimated) variance of log-demand 
20 for the target period computed as in (25); The weight w r has been estimated; / is 
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the inventory level to be sold out during the target period provided on input; p is 
the clearance price to be optimized; s is the salvage price (0<s<p) provided on 
input; and O is the standard normal distribution function; values of common factors 
in the common factors vector g T are known or have been estimated; estimate n 
5 has been obtained from regression fit. 

The revenue optimization problem is solved by maximizing the predicted revenue R 
in (27) in p under additional constraints on feasible prices resulting in the optimal 

revenue R sale and the optimal price p sale . Predicted sales volume (demand) V salr may 

be computed by formula 

10 V sale = I - I0(b T + g t ) + exp(w r + a j 1 2)&(b T ) 

and the predicted leftover stock I lefi is computed by formula 

heft = 1 T -Vsale * V sale < ^ ™<* heft =0 Otherwise 

Single Product — Clearance Price — Composite Period 

For each component subperiod /, the quantities u iT , a iT 2 , w Ti and the rest are 

15 computed exactly as in 'Single Product Clearance Price — Simple Period' case 
above. However, we have no analytical counterpart of the formula (27) for the 
predicted revenue. Therefore we use Monte Carlo for calculating the values of the 
predicted revenue on a grid of price values, and then perform optimization. 

Multiple Products: Group (Category) Optimization 



20 



Estimation and optimization for a group of products simultaneously are much more 
difficult than for a single product. 
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Multiple Products — Regular Price — Simple Period 

The meaning of the common price design matrix is that the sales of each product 
depend on the category price vector rather than on its own price alone. The category 
total predicted revenue is computed by formula 

5 * = IX (28) 

where 

R d = P d exp(w jr + o d j 12) for d = (29) 

are individual product revenues. Here p = (p ]9 ...,p D ) T is the category price vector. 
The quantities u dT , f d , K d ,<J dT 2 , w dT are computed for each item d exactly as 

10 above in formulas (24)-(26). On substituting them into (28)-(29), we obtain R as a 
function of the category price vector: 

R = l J Pd^V^dT^^dT /2) 
d=\ 

Multiple Products - Regular Price -- Composite Period 

The situation here is essentially a combination of 'Single Product - Regular Price - 
15 Composite Period' case and 'Multiple Products - Regular Price - Simple Period' case. 

Multiple Products ~ Clearance Price - Simple Period 

We begin with the case when clearance sale is planned for a single particular product 
in a category. This problem is difficult because at the start of the clearance period, 
the clear-out product influences the sales of the regular products, but once a clear- 
20 out product has been sold out, the number of competing products is reduced 
immediately, which changes the demand equations for all remaining products in the 
category. 
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We consider a category of a size D containing products 

of which the first D-l are regular products while the last Y is a single clear-out 
product with known inventory / and clearance period T . Denote by y cumulative 

5 demand for product Y over the clearance period, and by z d cumulative demand of a 

regular product Z d over the same period. Denote also the demand for product Z d at 

day / by z dt (y) if product Y was on sale at day t , by z dt if product 7 was not on 

sale at that day, and by z dt (yl) if product Y was sold out during that day. The 

demand y for the clear-out product Y depends on the category price vector 

10 /? = (/?,,. .-,p D -i,p D ) T . The demand z dt for a regular product Z d at day / depends on 
the category price vector p if Y was on sale at that day, and depends on the first 
D-\ prices if Y was not on sale at that day. Define the sums 5, = y,, S 2 = y { +y 2 , 
S T = y } +j> 2 + ... + y T / so that we have 0 < S x < S 2 < ... < S T . Now define T + l 
non-overlapping and exhaustive events: e x = {/ < 5,} which says that product 7 was 

15 sold during day 1; e 2 = {S x < I < S 2 ) which says that product Y was on sale for the 

whole day 1, and was sold out during day 2; e T+] = {5 r </} which says that product 

7 was not sold out over the clearance period. Denoting by r(Y) and r(Z d ), 

d =1,...,Z>-1, the revenues from products 7 and Z d respectively during the 

clearance period, by r the category revenue, and by R the expected category 
20 revenue, we can write 

R = Er = E(r\e l ) Pr(e, ) + ... + E(r \ e T+l ) Pr(e r+1 ) (30) 
where for each / = + 1 , we have 



E{r\e l ) = E{r(Y)\e l ) + J u E{r(Z d )\e l ) 
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= p D E(y\e l ) + Y d p d E(z d \e l ) (31) 

Now using straightforward derivations we can calculate the values of E(y\e f ) and 
E{z d \e t ) for all t = + l , and also probabilities Pr(e,), substitute them into 

formulas (31) and (30), and obtain the expected category revenue R which could be 
5 optimized in category price vector p under appropriate constraints. 

The cases with multiple clear-out products are more involved technically but can be 
solved similarly. 

Multiple Products — Clearance Price — Composite Period 

The situation is similar to the previous case, but is still more complex because 
10 composite period introduces additional technical difficulties. 

Regression Modeling and Objective Function Construction 

The process is shown for a group of products in the flow-chart in Fig. 7. For each 
product in the group (Block 701), a regression model is constructed (Block 702), and 
fitting and estimation are performed (Block 703). If a 'clearance sale scenario' has 
15 been selected by the user (Block 704), a clearance objective function is selected for 
the given product in Block 705; otherwise, a regular objective function is selected in 
Block 706. When all individual objective functions have been constructed (Block 
707), the total group objective function is constructed in Block 708. 

Pricing Optimization 

20 For a group of products, optimization has to be done in multidimensional pricing 
space and therefore is much more difficult than for a single product. In this 
invention, this optimization is done by a proprietary software based on the method of 
feasible directions. Current values of the prices serve as initial values in the 
algorithm. 

25 Error Estimation and Significance Testing 
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There are two steps here: Estimation of prediction errors and of other measures of 
statistical precision, and significance testing of optimal revenue. 

Estimation of Prediction Errors and of Other Measures of Statistical Precision 

We use statistical bootstrap (Efron and Tibshirani, 1993) for estimating measures of 
5 statistical precision: prediction errors, standard deviations and biases. Bootstrap 
mimics real computations with theoretical values replaced by the observed values, 
and the real distribution replaced by a bootstrap distribution. In particular, 
regression model construction is replaced by bootstrapping the 'true' regression 
model, fitting and estimation is performed on each bootstrapped regression model, 
io and then optimization of revenue and computation of optimal price is performed for 
each set of bootstrapped estimates. As a result, the following estimates are 
computed: bootstrap estimate of prediction error for maximum revenue err Boot (R) , 

bootstrap estimate of revenue standard deviation sd Boot (R) f bootstrap estimate of 
bias of maximum revenue bia$ Boot (R). Similar quantities are computed for optimal 
15 prices. 

First we consider computations for a single product. Let R sale denote the optimal 

revenue, and R* b S aie the optimal revenue for a bootstrapped model b where 
6 = 1,.. .,B are bootstrapped replications. The following estimates are computed: 

The mean of bootstrapped maximum revenues 



B 



20 



R sale = b sak I B 



Bootstrap estimate of mean squared error for maximum revenue R 



sale 



B 



MSE Bo JR) = (£(R 



sale - R sale) 2 ) I B 



Bootstrap estimate of prediction error for maximum revenue R l 



sale 
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err Bool (R)^(MSE Bool (R)) m 
Bootstrap estimate of revenue variance (the bootstrap sample variance) 



vaf^W = (£(/?* 6 I0 , e -R\ a ,e) 2 )/(B-\) 

6=1 



Bootstrap estimate of revenue standard deviation (the bootstrap sample standard 
5 deviation) 

^^W = (var^(/?)) 1/2 
Bootstrap estimate of bias of R sale 

bias Boot (R)^R\aie-R sale 

Similar quantities are computed for an optimal price. 
10 Significance Testing of Optimal Revenue for a Single Product 

After an optimal predicted revenue R sale and an optimal price p sale have been 
calculated, it should be tested if the predicted revenue R sale is significantly different 
from the current revenue R^. In particular, it may happen that considerable 
changes in price translate into negligible revenue changes. 

15 We will be testing the hypothesis 
against the one-sided alternative 

H 1 : ER sale > Rcur 
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Note that is a smoothed value of current revenue computed as the predicted 
revenue at the current period. The test statistic is 

t = R sale ~&cur 

The bootstrap null distribution is 

5 t b = R b sale — R 5 ale 

for Z> = The value of the computed bootstrap p-value p\t) represents 

evidence for or against the null hypothesis H Q : for small p-values it should be 
rejected. 

Group Significance Testing 

10 Prediction errors, standard deviations and biases are estimated for all products in a 
group exactly as shown above for a single product. Group significance testing, 
however, is different. Firstly, we have a group total revenue and a vector of 
individual revenues, and while the group revenue can not go down after 
optimization, some of the individual revenues can. Secondly, testing individual 

15 revenues is done according to the theory of multiple hypothesis testing. Thirdly, 
there may be cases when the group total revenue increases significantly while some 
or even all individual revenue changes are insignificant. 

Flow-chart in Fig. 8 shows major computational steps of group significance testing. 
In Block 801, a bootstrap one-sided test for the group total revenue is performed, 

20 and if found insignificant, the procedure terminates with no price changes suggested 
(Block 806). If the test is significant, however, all individual revenue changes are 
tested by a bootstrap multiple testing procedure with one-sided individual tests in 
Block 802, and if no significant changes are found, the procedure terminates with no 
price changes suggested. If some of the individual revenue changes are significant, 

25 the corresponding prices are adjusted in Block 803, and a recalculation of optimal 
total revenue is performed in Block 804. The newly optimized prices are suggested 
as new prices in Block 805. 
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Promotion Scheduling Modeling 

The aim of promotion scheduling at any given time is construction of the next 
optimized schedule. At the Initial Period, scheduling is done by a non-statistical 
Initial Promotion Scheduling Procedure that produces random time grids for 
5 allocating promotion clips on the in-store monitors. Afterwards, scheduling is being 
performed by the Promotion Scheduling Procedure. 

Promotion Scheduling Procedure 

At a current time period A:, we do the following. First, we choose a set of best 
schedules from all the available (already constructed) schedules, then perform local 
10 regression modeling and optimization in the vicinity of each of them, then select the 
schedule with the most promising predicted revenue increase, and then test its 
significance. If it is, the local improvement is attempted; otherwise a new scheduling 
point close to already existing points is added. 

For any schedule corresponding to period s , we will be doing local regression fitting 
15 and estimation, and optimization of predicted revenue increase by maximizing linear 
revenue increment dR under corresponding restrictions. It can be written as 

dR = fXA dJ dx dJ (32) 

d=\ 7=1 

where 

20 R d are the predicted values of individual product revenues at the scheduling point 

X {5) , p d j are the promotion scheduling coefficient estimators, dx dj are the 

promotion time changes, i. e. increments in the scheduling space that are control 
variables in maximization. Each R d in (33) can be written as 



R d = p d exp(x T d fi d +a d ) (34) 
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where 

a d = g T y d +p r x d +a d 2 (35) 

Here x d is the schedule for product d at period s , g is the common factors vector 
at period s, p is the category price vector at period s f J3 d is the promotion 
5 scheduling coefficient estimate, y d is the common factors coefficient estimate, K d is 
the price coefficient estimate, and 6 d is the variance estimate. Before maximizing 
dR we have to know or to estimate these quantities. The quantities x d , g , p are 
known while the estimates J3 d , ;r rf and a d are to be selected. For y d and tt^ 
we take the most recently computed estimates, i. e. y d{k) and 7t d{k) . As for fi d and 

10 a d , they are computed by locally weighted regressions by using all the neighboring 
points of the scheduling point X {s) . Substituting these estimates into (35), (34), 
(33) and (32), we obtain dR that can be maximized in control variables dx d . . 

After local optimizations have been performed for all s , the corresponding revenue 
increases are compared, and the best scheduling point is selected and tested for 
15 statistical significance. If significant, the corresponding optimized promotion schedule 
is constructed; otherwise, a new scheduling point in the vicinity of the existing 
schedules is constructed with the purpose of increasing the density of the data points 
in the exploration region and giving a better chance of obtaining significant revenue 
increase at a later period. 

20 Promotion Scheduling Procedure: Algorithmic Description 

Promotion Scheduling Procedure containing 5 steps is described with reference to 
flow-chart in Fig. 9. In Block 901, all input parameters are entered. If the current 
period is the Initial Period (Block 902), the next schedule is computed by the Initial 
Promotion Scheduling Procedure (Block 903), otherwise Step 1 is performed: 
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Step 1 (Block 904). Choose a subset of best schedules from all available schedules. 

Step 1 is done by the Selection of Best Schedules Procedure that selects a best 
schedule subset containing the best schedule and all other schedules that lie witching 
a confidence set of the best schedule. 

5 Step 2 (Block 905). For all best schedules, fit regressions and compute regression 
estimates. 

Step 2 is performed by the Promotion Estimation Procedure that uses locally 
weighted regressions with data-driven rules for band-width selection. 

Step 3 (Block 906). For all best schedules, perform local promotion optimization and 
10 select the best predicted revenue. 

Step 4 (Block 907). Test whether the best predicted revenue gives significant 
increase. 

This is done by the Bootstrap Multiple Significance Test of Revenue Increase in 
Promotion Scheduling that performs multiple hypothesis testing based on bootstrap 
15 resampling for construction of null distributions. 

If the result at Step 4 was 'significant 1 , the best computed schedule is set as the next 
promotion schedule in Block 909; otherwise, Step 5 is performed: 

Step 5 (Block 908). Construct a new promotion schedule in the vicinity of the 
existing schedule points 

20 Construction of a new promotion schedule in the vicinity of the given schedule is 
done by the Construction of Close Promotion Schedule Procedure; it makes random 
steps from the center of the set of existing schedules in the scheduling space. 

Pricing-Promotion Estimation-Optimization Loop 

From equation (22) we see that for estimation of a pricing model with no promotions 
25 we have to fit the equations 
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d 



(36) 



while for estimation of a promotion model under constant prices we have to fit 



(37) 



Fitting of equations (36)-(37) is going to be done in a stepwise manner, say, daily, 
5 using the sales data in the historical database up to the current day. It will be 
assumed that at the very start, there are available sales data for some previous 
period, say, one or two years. On the other hand, it is not assumed that detailed 
data on promotional activities have been stored in the database. As a result, we can 
construct at the start large enough matrices G and P though not matrices X d . 

io Therefore, at the initial period prices could be estimated and optimized while 
promotion schedules would be generated by an Initial Promotion Scheduling 
Procedure irrespective of availability of historical data on promotion scheduling. After 
the initial period is over, promotion coefficients could be estimated by regression 
methods. 

15 The situation may be quite different for pricing optimization. Firstly, influence of 
prices on sales appears to be much stronger than that of promotion displays. 
Secondly, a historical database is likely to contain abundant and detailed price 
records related to sales. Thus, even at the start, we are likely to have enough 
reliable data on pricing to be able to fit corresponding regression equations. 

20 Therefore, for a considerable period of time, we may have to deal with the equation 
(36) rather than the equation (22). However, even after the Initial Period is over, 
and equation (22) can be estimated, it may still contain too many parameters, so 
that we may prefer to fit separately (36) and (37). Besides an opportunity to use 
different chunks of data for fitting equations (36) and (37), we can also use different 

25 estimation methods, say, linear regression for pricing and locally weighted regression 
for promotions. 

The estimation-optimization loop consists of Preparatory Step, Initial Period and Main 
Period (see Fig. 1). Here we describe the Initial and Main Periods. 
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Initial Period 

Fig. 10 shows a flow-chart of the Initial Period containing a finite number of 
iterations over a fixed time period that has default duration of one day but may be 
changed by the user. There are two major parts here: Pricing Leg and Promotion 
5 Scheduling Leg. After inputting parameters in Block 1001, the loop over time periods 
begins. First comes the Pricing Leg (Block 1002) containing Blocks 1003 to 1007. By 
mining the secondary database, Regression Model Construction/Update Procedure 
constructs regression model (36) in Block 1003. In Block 1004, the Separate Pricing 
Estimation Procedure fits equations (36) and obtains regression estimates. In Block 

io 1005, an objective function is selected according to input parameters and user- 
selected options. In Block 1006, pricing optimization is performed provided the user 
requested it. Regular or clearance category prices are computed depending on user's 
requests. Error analysis and significance testing of obtained prices are performed in 
Block 1007; probable errors in revenues and prices implied by uncertainty in the 

15 data are computed. Promotion Scheduling Leg starts in Block 1008. In Block 1009, 
Initial Promotion Scheduling Procedure Computes initial (non-statistical) promotion 
time allocation for a product category. Secondary database updating is done in Block 
1010. When the Initial Period is over (Block 1011), the control is passed to the Main 
Period in Block 1012, otherwise a new iteration begins. 

20 Main Period 

The Main Period contains First Pricing Leg, Promotion Scheduling Leg and Second 
Pricing Leg (Fig. 11), and allows a potentially infinite number of iterations. The First 
Pricing Leg (Blocks 1102 to 1107) functions the same way as the Pricing Leg at the 
Initial Period, the main difference being that the regression model here contains 

25 promotion part as well. Promotion Scheduling Leg (Blocks 1108 to 1110) contains 
modules for Regression Model Construction/Update (Block 1109) and Promotion 
Scheduling Procedure for obtaining a next promotion schedule(Block 1110). It either 
computes optimal promotion time allocation for a product category or adds a new 
close schedule in the scheduling space. Second Pricing Leg (Blocks 1111 to 1116) 

30 differs from the First Pricing Leg in that the influence of promoting enters into the 
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regression model, so that numerical results will in general be different. Secondary 
Database Updating is done in Block 1117, after which the next iteration starts. 

User's Interface Module 

For performing tasks like pricing optimization, sales forecasting or setting up 
5 promotion scheduling the system has to receive from the user a number of option 
selections, parameter and variable values. Those selections are arranged into three 
frameworks: Pricing Optimization Scenarios, Pricing Prediction Scenarios, and 
Promotion Scheduling Scenarios. In all scenarios, the user is presented with a series 
of screens on which he is requested to tic menu options, make selections and enter 

io input parameter values. Bv so doing, the user effectively creates a computation 
scenario to be executed bv the system at a later stage. After the user has made the 
required option selections and filled in the slots with parameter values, the Scenario 
Processor transforms the underlying scenario into the form suitable for feeding it into 
the computation engine, then requests from the computation engine to perform the 

15 necessary computations, and afterwards presents to the user a report that contains 
the results in an easy to read form. Pricing Optimization Scenarios 

Fig. 12 gives a schematic presentation of user-system interaction via price 
optimization scenarios. After selecting a price optimization scenario in Block 1201, 
the user modifies scenario properties by ticking menu options and making selections 

20 in Block 1202, and selects product categories, product groups and products in Block 
1203. When he activates the system in Block 1204, the computations begin. After 
the system's computation process is over, the user can look at the reports and 
analyze results in tabled and graphical forms in Block 1205. By default, the system 
saves scenarios complete with reports for a later review. If the results are deemed 

25 satisfactory, the user can print them if he so wishes in Block 1206. Alternatively, he 
can make changes in menu selections and input parameters and request a new 
round of computations. 

Pricing Optimization Scenarios contain the following options: 
* Select task - pricing 
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* Select goal - optimization 

* Select figure-of-merit, i. e. revenue or profit 

* Select grouping mode - individual or group 

* Select pricing mode - regular or clearance 

5 * Select product categories, product groups, and products 

* Select scenario name 

* Select execution mode - immediate or delayed 
Pricing Optimization Scenario Reports 

In the preferred embodiment, Pricing Optimization Scenario Reports contain the 
10 following major screens. 

Screen 1 presents the list of scenarios that have been calculated and saved for 
future reference. For constructing a new scenario, the user clicks Scenarios on the 
Menu Bar, and then on the drop-down menu clicks New Scenario. 

Screen 2 shows the Modify Scenario Properties menu. The user selects desired 
15 scenario properties (parameters) by checking the corresponding options: on the Goal 
pane he checks Optimization, on the Figure of Merit pane he checks Revenue which 
means that group revenue will be optimized, on the Grouping Mode pane he checks 
Group. 

Screen 3 presents a product selection menu; here the user has to select product 
categories and product groups in the selected categories. If he is interested only in 
particular products in the selected groups he can select them as well. All selections 
are done by ticking the desired items on the tree structure on the upper left pane or 
alternatively on the alphabetical list on the bottom pane. Then the user clicks Select 
Checked Items at the bottom and the desired items appear on the right half of the 
screen. 
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Screen 4, Save and Calculate, gives an option to select a name for our scenario and 
either to calculate it immediately or save it without calculation. Click on Go performs 
the calculations. 

After calculation, the scenario appears on Scenario List page under the given name. 
5 To review the results, the user goes to that page, clicks on the scenario, then clicks 
Review Scenario, and Scenario Reports. 

Screen 5 shows the Scenario Reports pane featuring a tree on the left. The tree 
provides pointers to key output data in the scenario reports that may be displayed in 
tables on the right. 

10 Clicking a group name node and then Product Pricing nodes opens 18 kinds of price 
indices and error indicators for all products in the selected group. Six of them are 
common indices, six refer to Unconditional Optimization, and six to Conditional 
Optimization. Common indices are Current Price, Lowest Price, Highest Price, 
Minimum Price, Maximum Price, and Base Price. Lowest Price and Highest Price are 

15 calculated for the available sales period. Minimum Price, Maximum Price and Base 
Price for a product are set by Administrator. Optimal Price, Price Change Price and 
Change % are calculated by the System using prediction and optimization models 
based on the historical sales data. The remaining indices relate to conditional 
optimization when some of the prices in the group are not allowed to be changed by 

20 the user. 

Screen 6 shows the prices in graphical form. Three kinds of prices are graphically 
compared for all products in the group. Visual representation provides easier 
comprehension and price comparisons but lacks statistical indices given in the tables. 

Screen 7 shows the Scenario Reports for product revenues. Expanding the Group 
25 Revenues node shows 13 revenue indices and error indicators on the right. The 
Current Revenue is a common index, six indices refer to Unconditional Optimization, 
and six to Conditional Optimization. The meanings of the indices are similar to those 
for prices. 
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Pricino Prediction Scenarios 

In contrast to optimization scenarios, here the main output is forecasts based on 
user-suggested prices. To request a forecast, the user defines a product, or a 
product category, or a group in a category, the corresponding prices (which may be 
5 current prices or any prices of his choosing), the figure of merit to be forecasted, and 
a target period for forecasting. Then the Scenario Processor transforms the 
underlying scenario into the form suitable for feeding it into the computation engine, 
then requests from the computation engine to perform the necessary computations, 
and afterwards presents to the user a report that contains the results in an easy to 
10 read form. 

Fig. 13 gives a schematic presentation of user-system interaction via price prediction 
scenarios. After selecting a price prediction scenario in Block 1301, the user modifies 
scenario properties in Block 1302, and selects product categories, product groups 
and products in Block 1303. When he activates the system in Block 1304, the 

15 computations begin. After the system's computation process is over, the user can 
look at the reports and analyze results in tabled and graphical forms in Block 1305. 
By default, the system saves scenarios complete with reports for a later review. If 
the results are deemed satisfactory, he can print them if he so wishes in Block 1306. 
Alternatively, he can make changes in menu selections and input parameters and 

20 request a new round of computations. 

Pricing Prediction Scenarios contain the following options: 

* Select task - pricing 

* Select goal - forecasting 

* Select grouping mode - individual or group 
25 * Select pricing mode - regular or clearance 

* Select product categories, product groups, and products 

* Select scenario name 
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* Select execution mode - immediate or delayed 
Pricing Prediction Scenario Reports 

In the preferred embodiment, Pricing Prediction Scenario Reports contain the 
following major screens. 

5 Screen 1 presents the list of scenarios that have been calculated and saved for 
future reference. For constructing a new scenario, the user clicks Scenarios on the 
Menu Bar, and then on the drop-down menu clicks New Scenario. 

Screen 2 shows the Modify Scenario Properties menu. The user selects desired 
scenario properties (parameters) by checking the corresponding options: on the Goal 
io pane he checks Forecasting, on the Grouping Mode pane he checks Group. 

Screen 3 presents us with a product selection menu; here the user has to select 
product categories and product groups in the selected categories. If he is interested 
only in particular products in the selected groups he can select them as well. All 
selections are done by ticking the desired items on the tree structure on the upper 
15 left pane or alternatively on the alphabetical list on the bottom pane. Then he clicks 
Select Checked Items at the bottom and the desired items appear on the right half of 
the screen. 

Screen 4, Save and Calculate, gives an option to select a name for the scenario and 
either to calculate it immediately or save it without calculation. Click on Go performs 
20 the calculations. 

After calculation, the scenario appears on the Scenario List page under the given 
name. To review the results, the user goes to that page, clicks on the scenario, then 
clicks Review Scenario, and Scenario Reports. 

Screen 5 shows the Scenario Reports pane featuring a tree on the left. The tree 
25 provides pointers to key output data in the scenario reports that may be displayed in 
tables on the right. Clicking a group name node and then Product Pricing nodes 
opens 9 kinds of price indices for all products in the selected group. Those include 
Current Price, Suggested Price, Price Change, Price Change%, Lowest Price, Highest 
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Price, Minimum Price, Maximum Price, and Base Price. Lowest Price and Highest Price 
are calculated for the available sales period. Minimum Price, Maximum Price and 
Base Price for a product are set by Administrator. 

Screen 6 shows the prices in graphical form. Visual representation provides easier 
5 comprehension and price comparisons but lacks statistical indices given in the tables. 

Screen 7 shows the Scenario Reports for product revenues. Expanding the Group 
Revenues node shows 7 revenue indices on the right. The meanings of the indices 
are similar to those for prices. 

Promotion Scheduling Scenarios 

io Promotion Scheduling module differs from the Pricing Optimization module and 
Pricing Prediction module in that it constitutes_an__autonomous block_within the 
system that functions continuously, in general without user's intervention, and 
changes promotion schedules according to predefined time frames. The ongoing 
consequences of promotion optimization are absorbed into the database in the form 

15 of demand changes without providing any reports or messages to the user unless 
and until the user explicitly requested them or expressed his desire to introduce 
modifications into promotion schedules in the form, say, of restricting demonstration 
times of certain clips on certain monitors. In such a case, he will be presented with a 
number of options for modifying promotion scheduling or reports as the case may 

20 be. Of course, if the user decided to introduce modifications into the promotion 
network, the ensuing promotion schedules will not be optimized in the original sense, 
but will be only conditionally optimal. 

Fig. 14 gives a schematic presentation of user-system interaction via promotion 
scheduling scenarios. After selecting a promotion scheduling scenario in Block 1401, 
25 he modifies scenario properties in Block 1402, selects product categories, product 
groups and products in Block 1403, and selects promotion clips he wishes to run on. 

the in-store monitors for promotion purposes in Block 1404. When he activates the 
system in Block 1405, the computations begin. After the system's computation 
process is over, the user can look at the reports and analyze results in tabled and 
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graphical forms in Block 1406. If the results are deemed satisfactory, he can save or 
print the scenario results if he so wishes in Block 1407. Alternatively, he can modify 
product selections and clip sets in Block 1408, and request a new round of 
computations. 

5 Promotion Scheduling Scenarios contain the following options: 

* Select task - promotion 

* Select goal - optimization 

* Select grouping mode - individual or group 

* Select pricing mode - regular or clearance 

10 * Select product categories, product groups, and products 

* Select scenario name 

* Select execution mode - immediate or delayed 
Promotion Scheduling Scenario Reports 

In the preferred embodiment, Promotion Scheduling Scenario Reports contain the 
15 following major screens. 

Screen 1 presents the list of scenarios that have been calculated and saved for 
future reference. For constructing a new scenario, the user clicks Scenarios on the 
Menu Bar, and then on the drop-down menu clicks New Scenario. 

Screen 2 shows the Modify Scenario Properties menu. The user selects desired 
20 scenario properties (parameters) by checking the corresponding options: on the Goal 
pane he checks Promotion Scheduling, on the Figure of Merit pane he checks 
Revenue which means that group revenue will be optimized, on the Grouping Mode 
pane he checks Group. 

Screen 3 presents a product selection menu; here the user has to select product 
25 categories and product groups in the selected categories. If he is interested only in 
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particular products in the selected groups he can select them as well. All selections 
are done by ticking the desired items on the tree structure on the upper left pane or 
alternatively on the alphabetical list on the bottom pane. Then the user clicks Select 
Checked Items at the bottom and the desired items appear on the right half of the 
5 screen. 

Screen 4 presents a promotion clip selection menu; here the user has to select 
promotion clips he wishes to run on the in-store monitors for promotion purposes. If 
he is interested only in promoting particular products or groups he can select them 
as well. All selections are done by ticking the desired items on the tree structure on 
10 the upper left pane or alternatively on the alphabetical list on the bottom pane. 

Screen 5, Save and Calculate, gives an option to select a name for our scenario and 
either to calculate it immediately or save it without calculation. Click on Go performs 
the calculations. 

After calculation, the scenario appears on Scenario List page under the given name. 
15 To review the results, the user goes to that page, clicks on the scenario, then clicks 
Review Scenario, and Scenario Reports. 

Screen 6 shows the Scenario Reports pane featuring a tree on the left. The expected 
sales from optimized promotion may be displayed in tables on the right. 

Screen 7 shows the expected sales from optimized promotion in graphical form. 
20 Visual representation provides easier comprehension and price comparisons but lacks 
statistical indices given in the tables. 



Screen 8 allows the user to make changes in product selections and clip sets used in 
promotion. 



